Research on a Lip Reading Algorithm Based on Efficient-GhostNet

نویسندگان

چکیده

Lip reading technology refers to the analysis of visual information speaker’s mouth movements recognize content speech. As one important aspects human–computer interaction, lip has gradually become popular with development deep learning in recent years. At present, most networks are very complex, large numbers parameters and computation, model generated by training needs occupy memory, which brings difficulties for devices limited storage capacity computation power, such as mobile terminals. Based on above problems, this paper optimizes improves GhostNet, a lightweight network, it proposing more efficient Efficient-GhostNet, achieves performance improvement while reducing number through local cross-channel interaction strategy, without dimensionality reduction. The improved Efficient-GhostNet is used perform spatial feature extraction, then extracted features inputted GRU network obtain temporal sequences, finally prediction. We Asian volunteers recording dataset paper, also adopting data enhancement dataset, using angle transformation deflect process recorder 15 degrees each left right, order be able enhance robustness better reduce influence other factors, well improve generalization ability so that can consistent recognition scenarios real life. Experiments prove + achieve purpose comparable accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Lip-reading Method Using K-nearest Neighbor Algorithm

Many studies have been carried out on lip reading, most of those works are based on color images, while some essential features might not be obtained, like inner lip information. In this paper, RGBD camera will be introduced for improving the recognition rate of lip reading. We try to complete lip reading through using only gray-scale images. Thirteen groups of words are given, and we present e...

متن کامل

Lip-reading based on a fully automatic statistical model

In this paper, we describe audiovisual automatic speech recognition experiments carried using visual parameters extracted from “natural” images. Unlike many other experiments in the AV ASR field, these visual parameters are obtained without any hand-labeling phase and are naturally noisy, due to the extraction process. We evaluate our models with different strategies among which : use of a shap...

متن کامل

the effect of genre-based teaching on reading comprehension of literary texts

تحقیق حاضر به بررسی کاربرد روش ژانر-محور را در محیط آموزش زبان عمومی می پردازد.روش ژانر-محور به زبان آموزان کمک میکند که در زمینه خوانش پیشرفت کنند. بعضی از محققین معتقد اند که روش تدریس ژانر-محور به تدریج به زبان آموزان کمک می کند تا در درک ژانر های مختلف مهارت یابند (هایلند 2004).همچنین امروزه توجه روز افزونی به اهمیت استفاده از ادبیات در برنامه آموزشی زبان انگلیسی (esl/efl ) شده است. زمانی ک...

15 صفحه اول

Research on Color Watermarking Algorithm Based on RDWT-SVD

In this paper, a color image watermarking algorithm based on Redundant Discrete Wavelet Transform (RDWT) and Singular Value Decomposition (SVD) is proposed. The new algorithm selects blue component of a color image to carry the watermark information since the Human Visual System (HVS) is least sensitive to it. To increase the robustness especially towards affine attacks, RDWT is adopted for its...

متن کامل

a study on the effectiveness of textual modification on the improvement of iranian upper-intermediate efl learners’ reading comprehension

این پژوهش به منظور بررسی تأثیر اصلاح متنی بر بهبود توانایی درک مطلب زبان آموزان ایرانی بالاتر از سطح میانی انجام پذیرفت .بدین منظور 115 دانشجوی مرد و زن رشته مترجمی زبان انگلیسی در این پزوهش شرکت نمودند.

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12051151